208 research outputs found

    Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing

    Full text link
    The accuracy of Automated Speech Recognition (ASR) technology has improved, but it is still imperfect in many settings. Researchers who evaluate ASR performance often focus on improving the Word Error Rate (WER) metric, but WER has been found to have little correlation with human-subject performance on many applications. We propose a new captioning-focused evaluation metric that better predicts the impact of ASR recognition errors on the usability of automatically generated captions for people who are Deaf or Hard of Hearing (DHH). Through a user study with 30 DHH users, we compared our new metric with the traditional WER metric on a caption usability evaluation task. In a side-by-side comparison of pairs of ASR text output (with identical WER), the texts preferred by our new metric were preferred by DHH participants. Further, our metric had significantly higher correlation with DHH participants' subjective scores on the usability of a caption, as compared to the correlation between WER metric and participant subjective scores. This new metric could be used to select ASR systems for captioning applications, and it may be a better metric for ASR researchers to consider when optimizing ASR systems.Comment: 10 pages, 8 figures, published in ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17

    Multilingual Word Sense Induction to Improve Web Search Result Clustering

    Get PDF
    In [12] a novel approach to Web search result clustering based on Word Sense Induction, i.e. the automatic discovery of word senses from raw text was presented; key to the proposed approach is the idea of, first, automatically in- ducing senses for the target query and, second, clustering the search results based on their semantic similarity to the word senses induced. In [1] we proposed an innovative Word Sense Induction method based on multilingual data; key to our approach was the idea that a multilingual context representation, where the context of the words is expanded by considering its translations in different languages, may im- prove the WSI results; the experiments showed a clear per- formance gain. In this paper we give some preliminary ideas to exploit our multilingual Word Sense Induction method to Web search result clustering

    Beyond Textual Issues: Understanding the Usage and Impact of GitHub Reactions

    Full text link
    Recently, GitHub introduced a new social feature, named reactions, which are "pictorial characters" similar to emoji symbols widely used nowadays in text-based communications. Particularly, GitHub users can use a pre-defined set of such symbols to react to issues and pull requests. However, little is known about the real usage and impact of GitHub reactions. In this paper, we analyze the reactions provided by developers to more than 2.5 million issues and 9.7 million issue comments, in order to answer an extensive list of nine research questions about the usage and adoption of reactions. We show that reactions are being increasingly used by open source developers. Moreover, we also found that issues with reactions usually take more time to be handled and have longer discussions.Comment: 10 page

    Second-Order Belief Hidden Markov Models

    Get PDF
    Hidden Markov Models (HMMs) are learning methods for pattern recognition. The probabilistic HMMs have been one of the most used techniques based on the Bayesian model. First-order probabilistic HMMs were adapted to the theory of belief functions such that Bayesian probabilities were replaced with mass functions. In this paper, we present a second-order Hidden Markov Model using belief functions. Previous works in belief HMMs have been focused on the first-order HMMs. We extend them to the second-order model

    The Relationship Between Plasma Flow Doppler Velocities and Magnetic Field Parameters During the Emergence of Active Regions at the Solar Photospheric Level

    Full text link
    A statistical study has been carried out of the relationship between plasma flow Doppler velocities and magnetic field parameters during the emergence of active regions at the solar photospheric level with data acquired by the Michelson Doppler Imager (MDI) onboard the Solar and Heliospheric Observatory (SOHO). We have investigated 224 emerging active regions with different spatial scales and positions on the solar disc. The following relationships for the first hours of the emergence of active regions have been analysed: i) of peak negative Doppler velocities with the position of the emerging active regions on the solar disc; ii) of peak plasma upflow and downflow Doppler velocities with the magnetic flux growth rate and magnetic field strength for the active regions emerging near the solar disc centre (the vertical component of plasma flows); iii) of peak positive and negative Doppler velocities with the magnetic flux growth rate and magnetic field strength for the active regions emerging near the limb (the horizontal component of plasma flows); iv) of the magnetic flux growth rate with the density of emerging magnetic flux; v) of the Doppler velocities and magnetic field parameters for the first hours of the appearance of active regions with the total unsigned magnetic flux at the maximum of their development.Comment: 14 pages, 8 figures. The results of article were presented at the ESPM-13 (12-16 September 2011, Rhodes, Greece, Abstract Book p. 102-103, P.4.13, http://astro.academyofathens.gr/espm13/documents/ESPM13_abstract_programme_book.pdf

    Structural Invariance of Sunspot Umbrae Over the Solar Cycle: 1993-2004

    Full text link
    Measurements of maximum magnetic flux, minimum intensity, and size are presented for 12 967 sunspot umbrae detected on the NASA/NSO spectromagnetograms between 1993 and 2004 to study umbral structure and strength during the solar cycle. The umbrae are selected using an automated thresholding technique. Measured umbral intensities are first corrected for a confirming observation of umbral limb-darkening. Log-normal fits to the observed size distribution confirm that the size spectrum shape does not vary with time. The intensity-magnetic flux relationship is found to be steady over the solar cycle. The dependence of umbral size on the magnetic flux and minimum intensity are also independent of cycle phase and give linear and quadratic relations, respectively. While the large sample size does show a low amplitude oscillation in the mean minimum intensity and maximum magnetic flux correlated with the solar cycle, this can be explained in terms of variations in the mean umbral size. These size variations, however, are small and do not substantiate a meaningful change in the size spectrum of the umbrae generated by the Sun. Thus, in contrast to previous reports, the observations suggest the equilibrium structure, as testified by the invariant size-magnetic field relationship, as well as the mean size (i.e. strength) of sunspot umbrae do not significantly depend on solar cycle phase.Comment: 17 pages, 6 figures. Published in Solar Physic

    Resolving the infinitude controversy

    Get PDF
    A simple inductive argument shows natural languages to have infinitly many sentences, but workers in the field have uncovered clear evidence of a diverse group of ‘exceptional’ languages from Proto-Uralic to Dyirbal and most recently, Pirahã, that appear to lack recursive devices entirely. We argue that in an information-theoretic setting non-recursive natural languages appear neither exceptional nor functionally inferior to the recursive majority

    Higher dietary flavone, flavonol, and catechin intakes are associated with less of an increase in BMI over time in women: a longitudinal analysis from the Netherlands Cohort Study

    Get PDF
    BACKGROUND: Dietary flavonoids are suggested to have antiobesity effects. Prospective evidence of an association between flavonoids and body mass index (BMI) is lacking in general populations. OBJECTIVE: We assessed this association between 3 flavonoid subgroups and BMI over a 14-y period in 4280 men and women aged 55-69 y at baseline from the Netherlands Cohort Study. DESIGN: Dietary intake was estimated at baseline (1986) by a validated food-frequency questionnaire. BMI was ascertained through self-reported height (in 1986) and weight (in 1986, 1992, and 2000). Analyses were based on sex-specific quintiles for the total intake of 6 catechins and of 3 flavonols/flavones. Linear mixed effect modeling was used to assess longitudinal associations in 3 adjusted models: age only, lifestyle (age, energy intake, physical activity, smoking status, alcohol intake, type 2 diabetes, and coffee consumption), and lifestyle and diet (vegetables, fruit, fiber, grains, sugar, dessert, and dieting habits). RESULTS: After adjustment for age and confounders, the BMI (kg/m(2)) of women with the lowest intake of total flavonols/flavones and total catechins increased by 0.95 and 0.77, respectively, after 14 y. Women with the highest intake of total flavonols/flavones and total catechins experienced a significantly lower increase in BMI of 0.40 and 0.31, respectively (between group difference: P < 0.05). This difference remained after additional adjustment for dietary determinants and after stratification of median baseline BMI. In men, no significant differences in BMI change were observed over the quintiles of flavonoid intake after 14 y. CONCLUSION: Our results suggest that flavonoid intake may contribute to maintaining body weight in the general female population. AD - .s FAU - Hughes, Laura A E AU - CN - Netherlands Cohort Study LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - United States TA - Am J Clin Nutr JT - The American journal of clinical nutrition JID - 0376027 SB - AIM SB - I

    Positive words carry less information than negative words

    Get PDF
    We show that the frequency of word use is not only determined by the word length \cite{Zipf1935} and the average information content \cite{Piantadosi2011}, but also by its emotional content. We have analyzed three established lexica of affective word usage in English, German, and Spanish, to verify that these lexica have a neutral, unbiased, emotional content. Taking into account the frequency of word usage, we find that words with a positive emotional content are more frequently used. This lends support to Pollyanna hypothesis \cite{Boucher1969} that there should be a positive bias in human expression. We also find that negative words contain more information than positive words, as the informativeness of a word increases uniformly with its valence decrease. Our findings support earlier conjectures about (i) the relation between word frequency and information content, and (ii) the impact of positive emotions on communication and social links.Comment: 16 pages, 3 figures, 3 table
    • 

    corecore